Mining Spatial Association Rules with Geostatistics

نویسندگان

  • Jiangping Chen
  • Xiaojin Tan
چکیده

In 1962, G. Matheron introduced the term geostatistics to describe a scientific approach to evaluate problems in geology and mining, from ore reserve estimation to grade control. Geostatistics provides statistical methods used to describe spatial relationships among sample data and to apply this analysis to the prediction of spatial and temporal phenomena. They are used to explain spatial patterns and to interpolate values at unsampled locations. Geostatistics have traditionally been used in the sphere of geosciences: meteorology, mining, soil science, forestry, fisheries, remote sensing, and cartography. It later were successfully applied to economics, health, and other disciplines. Currently, it’s a trend to integrate powerful methods of geostaitsitcs into a geographic information system (GIS). This paper put forward a new algorithm of mining association rules with geostatistics in analyzing the epidemic problem. A key feature of epidemic data is their location in a space-time continuum. Geostatistics is independent of mean variance relationship and therefore can be used to verify more traditional methods of evaluation inner spatial structure. During structural analysis, spatial autocorrelation can be analyzed using covariance and semivariogram. With structural analysis predictions at unsampled locations can be made using geostatistic method such as kriging (i.e. multiple linear regression in a spatial context). Geostatistical analysis can interpret statistical distributions of data and also examine spatial relationships. It is capable of revealing how cohesion values vary over distance, and of predicting areas of high and low cohesion values. The geostatistics software provides tools for capturing maximum information on a phenomenon from sparse, often biased, and often under-sampled data. It is a good method for spatial data mining by taking account of the autocorrelation between the spatial data. In this paper, the first step is to use the geostatistics methods such as kriging, Spatial Autoregressive Model (SAR) to analyse and estimate the correlation of the land use/cover change and hay fever incidence. Then build a spatial autocorrelation model and then use the model to mining the spatial association rules. We can get the spatial frequency items from the autocorrelation Model. This replaces the repeated scanning of the spatial database by the measure of conventional spatial association rules mining. From the result of the example, the method is more quick and efficient than the traditional data mining algorithm Apriori.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Relationships between Spatial and Demographic Parameters and Urban Water Consumption in Esfahan Using Association Rule Mining

In recent years, Iran has faced serious water scarcity and excessive use of water resources. Therefore, exploring the pattern of urban water consumption and the relationships between geographic and demographic parameters and water usage is an important requirement for effective management of water resources. In this study, association rule mining has been used to analyze the data of municipal w...

متن کامل

Introducing an algorithm for use to hide sensitive association rules through perturb technique

Due to the rapid growth of data mining technology, obtaining private data on users through this technology becomes easier. Association Rules Mining is one of the data mining techniques to extract useful patterns in the form of association rules. One of the main problems in applying this technique on databases is the disclosure of sensitive data by endangering security and privacy. Hiding the as...

متن کامل

Spectroscopic Based Quantitative Mapping of Contaminant Elements in Dumped Soils of a Copper Mine

Possibility of mapping the distribution of Arsenic and Chromium in a mining area was investigated using combination of (VNIR) reflectance spectroscopy and geostatistical analysis. Fifty five soil samples were gathered from a waste dump at Sarcheshmeh copper mine and VNIR reflectance spectra were measured in a laboratory. Savitzky- Golay first derivative was used as the main pre-processing metho...

متن کامل

Using a Data Mining Tool and FP-Growth Algorithm Application for Extraction of the Rules in two Different Dataset (TECHNICAL NOTE)

In this paper, we want to improve association rules in order to be used in recommenders. Recommender systems present a method to create the personalized offers. One of the most important types of recommender systems is the collaborative filtering that deals with data mining in user information and offering them the appropriate item. Among the data mining methods, finding frequent item sets and ...

متن کامل

Mining Spatial Gene Expression Data Using Negative Association Rules

Over the years, data mining has attracted most of the attention from the research community. The researchers attempt to develop faster, more scalable algorithms to navigate over the ever increasing volumes of spatial gene expression data in search of meaningful patterns. Association rules are a data mining technique that tries to identify intrinsic patterns in spatial gene expression data. It h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008